Name | Version | Summary | date |
zenoml-next |
0.6.6 |
Upkeeping the now archived Zeno, the AI Data Management & Evaluation Platform |
2024-12-21 12:48:58 |
judges |
0.0.5 |
A small library of research-backed LLM judges |
2024-12-20 19:56:40 |
latenpy |
0.0.1 |
A package for lazy evaluation and caching to optimize scientific analysis workflows. |
2024-12-20 16:43:55 |
python-lilypad |
0.0.10 |
An open-source prompt engineering framework. |
2024-12-20 01:52:59 |
langsmith |
0.2.4 |
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform. |
2024-12-19 01:48:42 |
dyff |
0.30.0 |
Meta-package to install the local SDK for the Dyff AI auditing platform. |
2024-12-19 00:58:32 |
dyff-audit |
0.10.1 |
Audit tools for the Dyff AI auditing platform. |
2024-12-18 23:55:04 |
dyff-client |
0.15.0 |
Python client for the Dyff AI auditing platform. |
2024-12-18 18:20:12 |
dyff-schema |
0.20.0 |
Data models for the Dyff AI auditing platform. |
2024-12-18 03:28:57 |
evalscope |
0.8.1 |
EvalScope: Lightweight LLMs Evaluation Framework |
2024-12-17 12:09:21 |
trajectopy |
2.1.3 |
Trajectory Evaluation in Python |
2024-12-16 17:07:25 |
ms-opencompass |
0.1.5 |
A lightweight toolkit for evaluating LLMs based on OpenCompass. |
2024-12-16 08:05:22 |
pyclustkit |
0.1.0a2 |
A Python library for clustering operations. Evaluation and meta-feature generation. |
2024-12-14 16:46:34 |
maihem |
1.7.1 |
LLM evaluations and synthetic data generation with the MAIHEM models |
2024-12-13 23:07:11 |
tieval |
0.1.4 |
A framework for evaluation and development of temporal-aware models. |
2024-12-13 21:24:53 |
quotientai |
0.1.3 |
CLI for evaluating large language models with Quotient |
2024-12-13 19:38:06 |
langcheck |
0.9.0 |
Simple, Pythonic building blocks to evaluate LLM-based applications |
2024-12-12 01:41:29 |
evalica |
0.3.2 |
Evalica, your favourite evaluation toolkit. |
2024-12-11 21:19:10 |
agenta |
0.29.0 |
The SDK for agenta is an open-source LLMOps platform. |
2024-12-11 11:02:38 |
tno.sdg.tabular.eval.utility-metrics |
0.4.1 |
Utility metrics for tabular data |
2024-12-10 13:24:13 |